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DOCUMENT-IDENTIFIER: US 6647383 Bl 

TITLE: System and method for providing interactive dialogue and iterative search 
functions to find information 



Brief Summary Text (5) : 

A problem with the currently available search engines is that they fail to provide 
effective results for many search problems. The failures fall into a number of 
categories including, but not limited to, 1) formulating a query, 2) adequately 
displaying, manipulating and navigating through results, 3) determining for what 
the user is actually looking, and 4) remembering how to locate the results again. ■ 
From the point of view of web site owners, search engines have another set of 
shortcomings including, but not limited to 1) failure of metatags to provide 
sufficient information for an intelligent search approach, 2) failure of key words 
to formulate an intelligent search, and 3) the inherent requirement that web site 
owners maintain a large number of tags in inventory because of the ambiguity of 
language. Additionally, there exist some technical entities that can be improved 
upon in the course of a search. For example, it would be advantageous if redundant 
results could be eliminated, dead ends could be eliminated, and sources could be 
evaluated, tagged and screened. It would also be advantageous to provide search 
functions that correspond with known communities of users (or websites, etc.). 

Brief Summary Text (7) : 

Web search engines do not provide user access to restructure aspects of the search 
from a graphical user interface. When a search is conducted and results are' 
displayed, the decisions of the search engine are not displayed such that the user 
can manipulate the branches and navigate down the decision and results tree, 
changing the attributes and thereby finding slightly different results. The user is 
not provided with any information on what the extent of the results may be. The 
user is not afforded any opportunity to reconfigure the search or the results to 
display the relationship among the items returned. 

Brief Summary Text (25)-: 

Another advantage of the present invention is the determining of COI categories in 
fine granularity, representing COI categories and representing them differently for 
different COI's, representing relationships among COI categories and identifying an 
individual and the COI or COI's to which he or she belongs. 

Detailed Description Text (25) : 

That is, when text is input on a form, or in a search input field, it can be sent 
to the search engine. If there are thousands of results, and the search server 
determines that the data returned are spread over a number of subjects, sources, 
etc., than the word or phrase typed into the input field of the search can be 
checked to see if it has multiple meanings, has special meanings in a dictionary of 
special word usage associated with a specialty or COI, or if the catalogue of 
expert knowledge has the word or phrase catalogued . The record in the COI or expert 
database could be used to select likely elements that would be of interest to the 
user. 

Detailed Description Text (31) : 

Back-end 20 includes a front-end interface 28 for communicating search information 
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to and from front-end 18. A Smart Search engine 30 is provided that comprises 
functions, policies, and retrieval capability for identifying, locating and 
retrieving elements across a network of servers, as will be appreciated by those 
skilled in the art. Also provided are a DB (database) retrieval/searching component 
32 for requesting DB information, a DB engine 34 for performing all physical DB 
retrieval and update functions, a DB administration component 36 for performing DB 
administration functions requested by administrate^ 38, a web crawler 40 which 
scans the internet 16 searching for information to store and index in Information 
Database 42, and a DB build/ update component 44 that receives information from web 
crawler 40 and, in turn, uses DB engine 34 to build and update database 42. Other 
components, optionally used by Smart Search engine 30, are Lexical Dictionary 46, 
Expert Knowledge Database 48, Personality Profile Catalogue 50 and source profile 
catalogue 52. A data mining engine 54 is provided for building and maintaining 
information stores in relevant databases. A learning engine 56 is provided for 
building and maintaining the Expert Knowledge Database 48, the Personality Profile 
Catalogue 50 and a Display Techniques Database 58. Provisioning applications 60 are 
provided for provisioning users and relevant databases. A Community of Interest 
(COI) database 62 is provided for matching and assignment to user 12. More detailed 
functionality of Smart Search system 10 is provided in the following description. 

Detailed Description Text (37) : 

With reference to FIG. 4, page-by-page GUI sequence generator 24 also includes an 
iterative process 130 to disambiguate words. As shown, a word or phrase is supplied 
from user 12 at entry. point 132 or from a completed spell check operation 100 at 
entry point 134 and process 136 determines if the word has multiple definitions in 
the dictionaries 46. Process 136 will perform a standard dictionary lookup 138 to 
find alternate definitions 1 (140) and 2 (142) . A user prompt 144 is provided to 
user 12 to request whether definition 1, 2, other (not specified) or other 
(specified) should be selected. If definition 1 or definition 2 is selected, 
processing continues at 146. If other (not specified) is selected, a determination 
is made on whether it is a new word at 148, and the system is taught about the new 
word at 150 before continuing. If other (specified) is selected, the search is 
conducted using records from searches conducted by other members of the COI at 152 
as described, for example, in the Shriver and Small method described above, the 
patent application for which is incorporated herein by reference. Process 136 will 
also check, at node 154, profile catalogue 50 for a COI to which user 12 belongs. 
Process 156 checks the word or phrase in dictionaries and databases corresponding 
with selected COIs. From the results of process 156, process 158 returns alternate 
definition 3 for search purposes along with a database entry of known features 
corresponding to definition 3. Process 152 then conducts a search using alternate 
definition 3. 

Detailed Description Text (38) : 

As shown in FIG. 5, page-by-page GUI sequence generator 24 further includes a 
process 170 for handling cases wherein a search has returned a large number of 
elements, or rather a number exceeding a preset threshold. Data classification 
engine 54, in backend 20, is invoked to analyze the data as a first step 172. This 
will attempt to place elements into feature clusters or groups. This process can 
involve accessing information about the elements derived from previous searches by 
all users, searches by users in the same COI, and any of the databases mentioned in 
this patent. A simple example might entail a broad clustering of the domains from 
which each element was drawn based on access patterns by members of different 
COI 1 s . To minimize the amount of computation necessary per search session, the Data 
classification engine is expected to perform some or all of its learning functions 
offline . 

Detailed Description Text (39) : 

After the Data classification engine has analyzed the data, a decision 174 is made 
to determine whether there is a minimal number of groups or not. If the number is 
minimal, processing continues at node 176 and data is displayed by display function 
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178. The data can be displayed as a view of all high-level data 180, or 
alternately, a check 182 of the user profile can be performed (in Profile Catalog 
50), to determine how, at step 184, to display the data. For example, the display 
might comprise a table with rows and columns determined by data features. The 
information derived by the Data classification engine will also help inform the 
display process. 

Detailed Description Text (40) : 

If the number of groups is not minimal, it can be large. If so, processing 
continues at node 186. A database check 188 is performed on the Source 
Characterization Database 52 and the Expert Knowledge Database 48, looking for the 
groups that are important. If a group is interesting to a single COI, processing 
continues at node 190, and a view function 192 shows all data with features 
preferred by the COI based on references to Profile Catalogue 50, Source 
Characterization Database 52, and Knowledge Database 48. 

Detailed Description Text (48) : 

Guide to experts — The user interface provides a means to pull up a listing of 
members of each Community of Interest. Users identify when they want their identity 
to remain secured. Otherwise, members of the user group can seek the names of 
people or organizations having the same interests or expertise they need. The 
Personality Profile Catalogue 50 is used to create the listing of experts and 
interested persons. 

Detailed Description Text (51) : 

Data display function — The data display application uses the Personality Profile 
Catalogue 50 to identify the user's COI. There is a corresponding generic profile 
of Display types in the Display Techniques Database 58. These profiles of display 
techniques are available to be used when little is known about the preferences of 
the user. If the user is continually presented with a type of display and chooses 
another one from the toolbar into which the resulting elements may be transferred, 
then the profile is updated to reflect the preference. As the population of users 
in the same COI continue to use the display functions, the system will learn more 
about the group as a population and will ultimately learn how to create a better 
Display Techniques Database entry for that COI. There may be refined 
differentiation among user groups observed in this manner and the databases will 
ultimately contain many more finely differentiated databases of Personality 
Profiles and Display Technique preferences. 

Detailed Description Text (53) : 

"Tell me more about this" function — Once search results have been returned, the 
application has more data about the search and its resulting elements. It knows 
what the sources are, etc. The user may wish to learn more about things the way 
they can in the library. Once a person gets to a bookshelf or magazine rack in a 
library, many items related to the original search are present. These items may not 
have been explicitly listed in the search results but they are of potential 
interest to the user. The graphical user interface permits the user to continue a 
search by selecting one branch of the search and continuing in a more refined 
manner into the data that are related to the original search. Convenient methods of 
performing this function include using the Source Characterization Database 52 to 
categorize the results and preferably provide a link to results titled "Would you 
like more magazine articles about pedigreed cats?" The user could then access the 
information if desired. The Community of Interest is also used to prioritize the - 
questions asked of the user. The Expert Knowledge Database 48 maintains and updates 
the query formulation function. 

Detailed Description Text (56) : 

Community of Interest database 62 contains provisioned and learned information 
about COIs. It contains histories of searches conducted by members of COIs . 
Provisioned data includes pre-populated information including catalogues of known 
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professions, hobbies, demographic groups, etc . It contains references to lexical 
dictionaries, sources preferred, user profiles, display techniques, etc. It also 
contains references to Source Characterization Databases. 

Detailed Description Text (58) : 

To effectively provide service to new users, and those identified for the first 
time with a COI, a pre-populated global description can be used. Library lists and 
Indexes of Job Classifications may be used to begin the database. This will assist 
in filling in the spaces of COI's. Looking at the sources available online and the 
descriptions of the target audiences will also provide a piece of the COI database. 

Detailed Description Text (59) : 

Once users of the system collect around. each COI, the basic database 
representations may be modified to reflect the measurable requirements of the 
actual members of the group. The database will contain job categories, population 
categories, demographic information, population segmentations, memberships of known 
organizations and groups, hobbies and interests categories, etc. 

Detailed Description Text (64) : 
Personality Profile Catalogue 

Detailed Description Text (65) : 

With reference back to FIG. 1, Personality Profile Catalogue 50, besides containing 
profiles unique to individuals, contains personality profiles for celebrities and 
generic profiles for miscellaneous use or, perhaps, for use as templates for 
defining initial individual profiles. It should be noted that Personality Profile 
Catalogue '50 and other databases are not limited to the above-mentioned exemplary • 
personality profiles. 

Detailed Description Text (66) : 

Creating a COI profile in profile catalogue 50 for a user with or without 
registration by user would be done by: assessing technical papers, web site, email, 
subscriptions (including source characteristics), mail lists to and from, items 
forwarded, organization charts, directory entries, query entries of the individual 
user 12 to determine special interests and clustering of features in the elements 
identified (a selection of these and other items can be assessed depending on 
access provided through group service arrangement, or individual subscription) ; 
borrowing a personality from a generic group similar to the individual then making 
changes as more information becomes available; assessing selected items in search 
results relating to an area of interest to determine characteristics and 
associating them with the user's, profile; determining an age or social context of 
the individual; determining groups and individuals associated with the individual 
by reviewing email, technical papers, telephone and email .address book, post, 
articles containing individual's name or member of group, etc.; determining an 
individual user's relationship to the group: casual, involved, etc.'; recording 
multiple COI's of the individual using the system; determining attributes relating 
to understanding and use of language including but not limited to: primary language 
used, other languages used, competence and performance in each; and, determining 
attributes of language for each COI including but not limited to: access to 
vocabulary of each specialty, estimated sophistication in subject or area of 
interest, facility with vocabulary in specialty (competence and performance) . 

Detailed Description Text (67) : 

In addition, preferably there exists a database of catalogues of sources used in 
research, in a library, for market research, etc. The items in the catalogues, 
(books, journals, etc.) have known target audiences, and actual audiences. The 
audiences can change over time, and that is something we can observe by looking at 
the users selections online and updating the database. Over time there will be more 
fine granularity in the specialization of the sources. (Perhaps because online, 
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magazines and journals may not be tracked in their entirety, but may be counted by 
the individual articles, ads, etc.) There are temporal facts about elements, 
people, etc. that will make one item more desirable than another. For example, a 
research cardiologist might only be interested in items that are published within 
the past two years. When looking for a conference, he or she is only interested in 
conference listings that are recent, and that will occur at least one month in the 
future (for planning purposes). A student 8.sup.th grader may not be that sensitive 
to the calendar items. There is a different calendar that 1 s. important to a student. 
School year, holidays, time of day, etc. There may be some reason to select 
information or display information based upon the calendar people live by. 

Detailed Description Text (68) : 

As such, creating a proxy profile in profile catalogue 50 for a COI would be done 
by: identifying sources that, fit the COI (an additional tier of sources should 
exist to provide flexibility) ; and, identifying a calendar for relevant items that 
best fits the COI (a secondary calendar should exist) . 

Detailed Description Text (69) : 

FIG. 10 provides a flow chart of a Profile Provisioning for Search Function 280. A 
test 282 is first performed to determine if the user is known. In case test 282 
returns a yes answer, yes path 284 is invoked where test 286 determines how the 
user is known. Possible answers to test 286 are, for example, a company database, a 
social group/organization database, public database/directory, school government 
database, found online (email, web site, document, papers, etc.), other private, 
other public, and has profile. Once it is determined how the user is known, 
processing continues at node 288 to populate profile catalogue 50. 

Detailed Description Text (70) : 

Returning to test 282, if a no answer is returned, no path 290 is invoked where 
function 292 offers the user the following profile options: provide profile 
information, select a generic profile, select a celebrity profile, or bypass the 
profile option. If any choice other than the bypass option is returned by the user 
at 294, processing continues at node 288 to populate profile catalogue 50. Bypass 
path 296 is invoked if the user chooses to bypass the profile option, where 
collection function 298 gathers keyboard/user interface entries and items selected 
from a results listing. Compare function 300 compares the items gathered by the 
previous collection function with known COIs. Select function 302 then selects the 
closest COI for addition to profile catalogue 50. 

Detailed Description Text (71) : 

Once it is determined that Personality Profile Catalogue 50 needs to be populated 
or updated from the above-described procedures, the task is reduced to determining 
what to put into Personality Profile Catalogue 50. FIG. 11 shows a flow chart for a 
portion of process 310 to make this determination, and this determination is made 
depending on the source of information. Process 312 handles the case where the 
source is a social group/organization database. 

Detailed Description Text (72) : 

Catalogue function 314 catalogues the social group, relationships and organization 
into profile database 50. Identify function 316 identifies the COI of the group 
including subcategories for populating profile database 50. Scan function 318 scans 
documents, web sites, etc. for words and terms to populate profile database 50. 
Scan function 320 scans email for subjects, senders, etc. to populate profile 
database 50. Scan function 322 scans and processes text files to create a group 
profile. It creates a lexicon of terms used by the group or correlates the lexicon 
from known sources such as organization/sub j ect dictionaries. Create function 324 
creates a geographic representation of the organization/group, locations of people, 
etc. to populate profile database 50. 

Detailed Description Text (73) : 



http://westbrs:9000^in/gate.exe?f^doc&state=94dl86J9.5&ESNAME 7/29/05 



Record Display Form 



Page 6 of 9 



FIG. 12 shows a flow chart for process 330 which handles the case where the source 
is found online based on the user's ID or name. Decision function 332 determines 
whether the source is a personal web site, directory, document, etc. If the source 
is a personal web site, scan function 334 scans the web site for clusters of terms 
used. Correlation function 336 correlates the clusters with one or more COIs. 
Matching function 338 chooses the best fit COIs, and populate function 340 
populates profile catalogue 50 with a template of that COI or COIs. If decision 
function 332 determines that the source is an online directory, scan function 342 
scans the directory for features, geography, organization charts, etc. Select 
function 344 selects a path to a generic profile with identifying data and 
geography or communication, if known. If decision function 332 determines that the 
source is an online document, scan function 346 scans the document and determines 
if an author is known. If an author is known, path 348 is traversed, and weight 
function 350 weighs the document high because the author is known. Check function 
352 attempts to determine the publisher source, and matching function 354 
determines a best fit COI. If the author is unknown, however, content match 
function 356 is invoked, and check function 358 searches for word clusters of 
interest. 

Detailed Description Text (76) : 

FIG. 15 shows a flow chart for the final process 380, of determination process 310, 
which handles the case where the source is a company database. Function 382 
references organization charts, relationships and departments. Function 384 
determines a job title, and if a COI matches the title directly, populates profile 
catalogue 50 with the COI. Function 386 references the job description, and 
performs cross-referencing of intelligent databases, and catalogues information if 
necessary for populating profile catalogue 50. Function 388 references' any 
referenced web sites and scans those web sites for clustering categories and 
clusters of text contents for determining COIs. Function 390 references emails and 
handles these with appropriate scanning functions, scanning for content, senders 
information and subjects. Function 392 references documents databases and scans for 
features and clusters within those databases. Function 394 references 
communications links such as, for example, email, phone, fax, to determine features 
and clusters for correlating to COIs. And finally, function 396 references 
geographic locations for home, office, etc. for populating profile catalogue 50. 

Detailed Description Text (78) : 

Source Characterization Database 52 contains catalogues of sources, target 
audiences, demographics, publishers, useful features, cross references, COI 
references, provisioned and learned, market research, dates, frequencies of updates 
and types of source (electronic, broadcast, video, personal web site, etc.). 

Detailed Description Text (79) : 

Creating a proxy source catalogue in characteristics catalogue 52 would be done by: 
identifying known subject matter sources including but not limited to journals, 
newspapers, magazines, web sites, brick and mortar sources, paper published 
sources, etc.; and, populating the catalogue 52 with descriptions, links-, profile 
of language, type of data available in each source, availability online. 

Detailed Description Text (80) : 

Thus, resources on the network of networks can be catalogued by labeling each known 
resource (journals, periodicals, publications, organizational publications, 
professional publications, newspapers, personal sites, business sites, catalogues, 
library resources, market research, etc.), identifying COI ' s interested in each 
known resource, correlating vocabulary used in known resources with COI,* and 
periodically checking the vocabulary and update it with new buzz words, jargon, 
etc . 

Detailed Description Text (93) : 

The personality profile catalogue or database 50, as noted above, is a database 
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that can identify characteristics of the user that will assist in formulating a 
query, determining which items in a list the user may be interested in, selecting a 
display technique, determining which sources to prioritize, etc. Problems faced in 
using the strategy of COI for search include being able to identify the COI of the 
user without collecting specific information from the user. The system can store a 
series of default profiles, then observe user behavior, query user preferences, and 
modify the profile for future searches. 

Detailed Description Text (101) : 

Writing style in the sources selected by user 12 or by an expert will be used to 
identify characteristics of the' COI. These characteristics will be catalogued by 
Smart Search system 10. Writing style of the user 12 will be characterized and 
stored in a personality profile 50 database. 

Detailed Description Text ' (103) : 

Search results sources will be catalogued to identify characteristics of the 
sources including but not limited to: dates and frequency, COI, COI 1 s, general 
level, degree and level of jargon, categories of resources (calendars of events, 
classified, technology reviews, etc.) and will be used to inform user about 
sources. The source characteristics catalogue 52 will record whether the source is 
a primary source, secondary source, web site, commercial publication, technical, 
business, fiction, or other type of source. Experts will annotate the source. 
Source participants can add information to the record about the source. 

Detailed Description Text (104): 

One method of presenting results is to list categories in a series of nested menu 
windows based upon the data returned and how it clusters in categories, and 
metacategories . Category names can be derived from the data and then recorded for 
future use by the Smart Search system 10 application. Categories can also be 
derived from the data with the supervision of an expert resource person, or subject 
matter expert. Category differentiation is based upon fine-grained understanding 
and differentiation among terms, phrases, reserved words, jargon, technical terms, 
sources selected, and resulting elements. 

Detailed Description Text (105) : 

The complete results presentation will contain representations of each distinct 
concept represented in the results and not constrained by COI, and a reference to 
each data point in. each concept representation. Since many elements in a search 
result list belong to more than one category or abstract concept, the relationship 
among categories will be presented. 

Detailed Description Text (107) : 

When sources are identified by the Smart Search system 10 application, annotation 
from the cataloguing process will be made available. The annotation will inform the 
user 12 about the source and about who created it and who (COI) uses the source. 
This information may be* provided by the source owner, an expert, or automatically 
determined. For certain applications Smart Search system 10 will also link to the 
main information center for the source identified. 

Detailed Description Text (115) : 

Another means for altering. the direction of the search application is to add 
clarification by looking at the data represented in the results. By selecting other 
features of the data to display relationships not clearly illustrated in the first 
display of the data, a search result may reveal new information to the user. The 
results data is representing the view from a particular point of view or COI . The 
user is not limited to this presentation of the data. Other features identified in 
the search are available and may be selected from a window, or by another means of 
manipulating the data of the results. A helpful device would be a window that lists 
all the recognized significant data relationships in the search results. For 
example, if 20% of the search results were from technical journals, and 15% of the 
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search results were from items dating within the past ten years, those facts would 
be listed in the window for the user to see. The data itself reveals this 
information, using the database stored in the search application server to verify 
sources, publication dates, subjects and relevance, etc. Attributes about the users 
of the source are also relevant to the COI . If a description of the users of a 
source is made available, then the COI can be matched more directly with the 
targeted market of the sources. Even if the results are displayed by categories 
assumed to be important to the COI, there may be a feature in the data that is of 
interest to the user and is not graphically represented. By using a method such as 
the window mentioned above, a user may identify a feature that is of interest, and 
request that the display be modified to illustrate that feature. 

Detailed Description Text (116) : 

Example: Theater Directions Magazine Monthly 20,000 Circulation US Published in 
English Specialized periodical for Theater professionals Ages of readers 
Demographics of readers Additional information about readers 35 Articles monthly 
Information about Authors 200 Advertisers Information about advertisers Personals 
Classifieds Etc. 

Detailed Description Text (118): 

Search Application: Prefers Theater Directions Magazine as a source for search 
conducted by a member of the Thespian COI. A search may result in a listing of 
results spanning 3 major unrelated COI 1 s . For example, a search on Home Networking 
may result in thousands of items in a list spanning many pages of titles. When 
looking at the results, they may fall into the categories of: home communications 
network architectures including such subjects as computer LAN's, etc., home health 
aides visiting the ill or elderly, and home school programs using a network of 
teachers and other resources. By using the strategy associated with Communities of 
Interest, the system would identify which major subject area was related to a 
particular COI through: knowledge of the sources of materials, subject handled by 
the source, types of authors, types of content, disambiguating words used for the 
search and filtering for features related to the COI, etc. Without specific 
knowledge of the COI, the total of the search results would be segregated based 
upon strata and subject as defined in the global COI database. The results would 
then be displayed showing the features that differentiate them, in this case being 
home electronics/communications, social work, education, and other. 

Detailed Description Text (121) : 

Humans are particularly good at identifying patterns in data when the data is 
presented in an appropriate graphical form. The key is to identify reasonable 
graphical representations for abstract data elements. It is another object of the 
present invention to identify subject matter and object classifications that will 
be modeled graphically so search results may be presented to the user 12 . An 
example for representing solutions graphically would involve using a metaphor for 
placing pushpins representing search returned objects at a high level and with 
relationships described graphically (between the returned objects) . Color can be 
used to indicate proximity characteristics. Sound can be used to illustrate 
relationships among items, and to represent navigational distances between objects. 
Sounds can enhance an application so audio-based interfaces may be used in addition 
to graphical. For example, a search might be conducted using a cell phone, and the 
user 12 will listen to results receiving audio cues to help formulate relationships 
among returned items. For example, suppose a consumer is searching for a particular 
car.. The consumer knows some attributes relating to the car, such as the 
manufacturer and the year. The Smart Search will enable the consumer user 12 to use 
a configuration management approach to filling in a form through which the search 
will be formulated. The assumption here is that the consumer's shopping interests 
can vary along a continuum from a very directed search through to a rather 
unconstrained browsing type of search. An important issue relating to these 
dimensions of searching is the ability for a search engine to have varying degrees 
of information presented during the query and still return useful, targeted, and 
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understandable results. 
CLAIMS : 

2. The method as set forth in claim 1 further comprising: representing interest 
categories and representing each differently for different interests; representing 
relationships among the categories ; and, identifying an individual and the interest 
categories to which he or she belongs. 

4. The method as set forth in claim 1 further comprising creating, updating and 
accessing personality profile records in a personality profile catalogue . 

10. The method as set forth in claim 1 further comprising iterating using 
configurable query techniques, using a lexical dictionary for disambiguation of 
query text. 

12. The system as set forth in claim 11 further comprising at least one database 
for storing information representing interest categories, representing 
relationships among the categories and identifying an individual and the interest 
categories to which he or she belongs. 

14. The system as set forth in claim 11 further comprising a personality profile 
catalogue . 

17. The system as set forth in claim 11 further comprising a Community of Interest 
Database for storing the interest categories . 

20. The system as set forth in claim 11 wherein the interface facilitates 
communication of configurable query techniques, using a lexical dictionary for 
disambiguation of query text. 

Previous Doc Next Doc Go to Doc# 
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ASSISTANT -EXAMINER : Alam; Shanid 

ATT Y-AGENT- FIRM: The Hecker Law Group 

ABSTRACT : 

A method and apparatus for improving access to literature is described. Embodiments 
of the invention comprise an access mechanism that searches current and past 
literature (e.g., journal publications or other articles) and selects some or all 
of the literature for a user based on criteria established for the user. In one 
embodiment of the invention, the access mechanism is coupled to an electronic 
medical records system used to enter patient information and user profile 
information and coupled to one or more literature (e.g., medical, scientific, 
current affairs, law, dental, etc.) libraries or database (s). Search criteria is 
obtained from user profile information established for a physician (or other user 
of the electronic medical records system) and patient information. The search 
criteria is used to generate a request for literature from the libraries. The 
search criteria may act as a filter of the literature that is contained in a 
library. In addition, the results of the request obtained from a library may be 
filtered based on criteria established for the user. The user may view, save, 
and/or print the results. 

40 Claims, 16 Drawing figures 
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ABSTRACT: 



A high-speed document retrieval system creates a regular expression dictionary and 
a word index on the basis of a retrieval document and a word dictionary to conduct 
retrieval to a document through the regular, expression dictionary and the word 
index at a high speed. A regular expression dictionary expressing a set of 
character strings having the same length is created from a word dictionary . In 
terms of a character string included in a retrieval document and matching with a 
regular expression in the regular expression dictionary, an index element is 
recorded in a word index when there is no different index element .which allows an 
observing index element to be deducible, which eventually produces a word index 
capable of achieving a high-speed full-text retrieval without the noticeable 
increase in the index capacity. 
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ABSTRACT : 

An information retrieval system providing for cross-lingual information retrieval. 
The method as implemented in the system causes elementary words to be extracted 
from a search expression. These elementary words are translated and stemmed into 
the target language (s). Searches are then made for documents containing 
combinations of the stemmed and translated elementary words. When documents are 
retrieved they are verified that they contain the same linguistic structure as the 
initial search expression. Documents that cannot be verified are eliminated from 
the set of search documents returned to the user. 
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ABSTRACT : 

A database accessing system for processing a request" to access a database including 
a multiplicity of entries, each entry including at least one word, the request 
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system including a similar word finder operative, for at least one interpretation 
of each representation, to find at least one database word which is at least 
similar to that interpretation, and a database entry evaluator operative, for each 
database word found by the similar word finder; to assign similarity values for 
relevant entries in the database, said values representing the degree of similarity 
between each database entry and the request. 
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ABSTRACT: 
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accordance with the received selection instruction. 
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A system and method for information searching comprising determination of, in fine 
granularity, a Community of Interest (COI), further data mining in search results, 
using at least one of COI and expert preferences to identify important knowledge, 
formulation and manipulation of results, and summarization of search results into a 
document like entity with dynamic attributes described. More particularly, the 
invention relates to a system and method for providing interactive dialogue and 
iterative search functions to find information on a large network of servers such 
as the world wide web. 
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ABSTRACT : 

A general analysis system for performing an analysis used in the control of an 
industrial process. The general analysis system comprises a general analysis 
program for performing at least one analysis on a set of data and an analysis 
control data for controlling the analysis of the data set. According to the present 
invention, the general analysis program comprises a plurality of analysis 
primitives and the analysis control data comprises a plurality of analysis modules. 
An analysis may be constructed from a set of analysis modules and performed by 
interpreting this set using at least one of the analysis primitives. 
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ABSTRACT : 



A natural-language processing system such as a machine-translation system employs a 
tree structure of increasingly specialized system dictionaries and attaches user 
dictionaries to individual system dictionaries in the tree, or helps users edit 
their user dictionaries by displaying lists of unknown words encountered in 
translations, or uploads processing programs such as translation engines to a 
dictionary server to make dictionary access more efficient, or combines a source 
document and a machine translation thereof into a single document in such a way 
that the reader of the translation can conveniently see the original source text, 
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Brief Summary Text (5) : 

A problem with the currently available search engines is that they fail to provide 
effective results for many search problems. The failures fall into a number of 
categories including, but not limited to, 1) formulating a query, 2) adequately 
displaying, manipulating and navigating through results, 3) determining for what 
the user is actually looking, and 4) remembering how to locate the results again. 
From the point of view of web site owners, search engines have another set of 
shortcomings including, but not limited to 1) failure of metatags to provide 
sufficient information for an intelligent search approach, 2) failure of key words 
to formulate an intelligent search, and 3) the inherent requirement that web site 
owners maintain a large number of tags in inventory because of the ambiguity of 
language. Additionally, there exist some technical entities that can be improved 
upon in the course of a search. For example, it would be advantageous if redundant 
results could be eliminated, dead ends could be eliminated, and sources could be 
evaluated, tagged and screened. It would also be advantageous to provide search 
functions that correspond with known communities of users (or websites, etc.). 

Brief Summary Text (7) : 

Web search engines do not provide user access to restructure aspects of the search 
from a graphical user interface. When a search is conducted and results are 
displayed, the decisions of the search engine are not displayed such that the user 
can manipulate the branches and navigate down the decision and results tree, 
changing the attributes and thereby finding slightly different results. The user is 
not provided with any information on what the extent of the results may be. The 
user is not afforded any opportunity to reconfigure the search or the results to 
display the relationship among the items returned. 

Brief Summary Text (25) : 

Another advantage of the present invention is the determining of COI categories in 
fine granularity, representing COI categories and representing them differently for 
different COI's, representing relationships among COI categories and identifying an 
individual and the COI or COI's to which he or she belongs. 

Detailed Description Text (25) : 

That is, when text is input on a form, or in a search input field, it can be sent 
to the search engine. If there are thousands of results, and the search server 
determines that the data returned are spread over a number of subjects, sources, 
etc., than the word or phrase typed into the input field of the search can be 
checked to see if it has multiple meanings, has special meanings in a dictionary of 
special word usage associated with a specialty or COI, or if the catalogue of 
expert knowledge has the word or phrase catalogued . The record in the COI or expert 
database could be used to select likely elements that would be of interest to the 
user . 

Detailed Description Text (31): 

Back-end 20 includes a front-end interface 28 for communicating search information 
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to and from front-end 18. A Smart Search engine 30 is provided that comprises 
functions, policies, and retrieval capability for identifying, locating and 
retrieving elements across a network of servers, as will be appreciated by those 
skilled in the art. Also provided are a DB (database) retrieval/searching component 
32 for requesting DB information, a DB engine 34 for performing all physical DB 
retrieval and update functions, a DB administration component 36 for performing DB 
administration functions requested by administrator 38, a web crawler 40 which 
scans the internet 16 searching for information to store and index in Information 
Database 42, and a DB build/update component 44 that receives information from web 
crawler 40 and, in turn, uses DB engine 34 to build and update database 42. Other 
components, optionally used by Smart Search engine 30, are Lexical Dictionary 46, 
Expert Knowledge Database 48, Personality Profile Catalogue 50 and source profile 
catalogue 52. A data mining engine 54 is provided for building and maintaining 
information stores in relevant databases. A learning engine 56 is provided for 
building and maintaining the Expert Knowledge Database 48, the Personality Profile 
Catalogue 50 and a Display Techniques Database 58. Provisioning applications 60 are 
provided for provisioning users and relevant databases. A Community of Interest 
(COI) database 62 is provided for matching and assignment to user 12. More detailed 
functionality of Smart Search system 10 is provided in the following description. 

Detailed Description Text (37): 

With reference to FIG. 4, page-by-page GUI sequence generator 24 also includes an 
iterative process 130 to disambiguate words. As shown, a word or phrase is supplied 
from user 12 at entry point 132 or from a completed spell check operation 100 at 
entry point 134 and process 136 determines if the word has multiple definitions in 
the dictionaries 46. Process 136 will perform a standard dictionary lookup 138 to 
find alternate definitions 1 (140) and 2 (142) . A user prompt 144 is provided to 
user 12 to request whether definition 1, 2, other (not specified) or other 
(specified) should be selected. If definition 1 or definition 2 is selected, 
processing continues at 146. If other (not specified) is selected, a determination 
is made on whether it is a new word at 148, and the system is taught about the new 
word at 150 before continuing. If other (specified) is selected, the search is 
conducted using records from searches conducted by other members of the COI at 152 
as described, for example, in the Shriver and Small method described above, the 
patent application for which is incorporated herein by reference. Process 136 will 
also check, at node 154, profile catalogue 50 for a COI to which user 12 belongs. 
Process 156 checks the word or phrase in dictionaries and databases corresponding 
with selected COIs. From the results of process 156, process 158 returns alternate 
definition 3 for search purposes along with a database entry of known features 
corresponding to definition 3. Process 152 then conducts a search using alternate 
definition 3. 

Detailed Description Text (38) : 

As shown in FIG. 5, page-by-page GUI sequence generator 24 further includes a 
process 17 0 for handling cases wherein a search has returned a large number of 
elements, or rather a number exceeding a preset threshold. Data classification 
engine 54, in backend 20, is invoked to analyze the data as a first step 172. This 
will attempt to place elements into feature clusters or groups. This process can 
involve accessing information about the elements derived from previous searches by 
all users, searches by users in the same COI, and any of the databases mentioned in 
this patent. A simple example might entail a broad clustering of the domains from 
which each element was drawn based on access patterns by members of different 
COI's. To minimize the amount of computation necessary per search session, the Data 
classification engine is expected to perform some or all of its learning functions 
offline . 

Detailed Description Text (39) : 

After the Data classification engine has analyzed the data, a decision 174 is made 
to determine whether there is a minimal number of groups or not. If the number is 
minimal, processing continues at node 176 and data is displayed by display function 
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178. The data can be displayed as a view of all high-level data 180, or 
alternately, a check 182 of the user profile can be performed (in Profile Catalog 
50), to determine how, at step 184, to display the data. For example, the display 
might comprise a table with rows and columns determined by data features. The 
information derived by the Data classification engine will also help inform the 
display process. 

Detailed Description Text (40) : 

If the number of groups is not minimal, it can be large. If so, processing 
continues at node 186. A database check 188 is performed on the Source 
Characterization Database 52 and the Expert Knowledge Database 48, looking for the 
groups that are important. If a group is interesting to a single COI, processing 
continues at node 190, and a view function 192 shows all data with features 
preferred by the COI based on references to Profile Catalogue 50, Source 
Characterization Database 52, and Knowledge Database 48. 

Detailed Description Text (48) : 

Guide to experts — The user interface provides a means to pull up a listing of 
members of each Community of Interest. Users identify when they want their identity 
to remain secured. Otherwise, members of the user group can seek the names of 
people or organizations having the same interests or expertise they need. The 
Personality Profile Catalogue 50 is used to create the listing of experts and 
interested persons. 

Detailed Description Text (51) : 

Data display function — The data display application uses the Personality Profile 
Catalogue 50 to identify the user's COI. There is a corresponding generic profile 
of Display types in the Display Techniques Database 58. These profiles of display 
techniques are available to be used when little is known about the preferences of 
the user. If the user is continually presented with a type of display and chooses 
another one from the toolbar into which the resulting elements may be transferred, 
then the profile is updated to reflect the preference. As the population of users 
in the same COI continue to use the display functions, the system will learn more 
about the group as a population and will ultimately learn how to create a better 
Display Techniques Database entry for that COI. There may be refined 
differentiation among user groups observed in this manner and the databases will 
ultimately contain many more finely differentiated databases of Personality 
Profiles and Display Technique preferences. 

Detailed Description Text (53) : 

"Tell me more about this" function — Once search results have been returned, the 
application has more data about the search and its resulting elements. It knows 
what the sources are, etc. The user may wish to learn more about things the way 
they can in the library. Once a person gets to a bookshelf or magazine rack in a 
library, many items related to the original search are present. These items may not 
have been explicitly listed in the search results but they are of potential 
interest to the user. The graphical user interface permits the user to continue a 
search by selecting one branch of the search and continuing in a more refined 
manner into the data that are related to the original search. Convenient methods of 
performing this function include using the Source Characterization Database 52 to 
categorize the results and preferably provide a link to results titled "Would you 
like more magazine articles about pedigreed cats?" The user could then access the 
information if desired. The Community of Interest is also used to prioritize the 
questions asked of the user. The Expert Knowledge Database 48 maintains and updates 
the query formulation function. 

Detailed Description Text (56) : 

Community of Interest database 62 contains provisioned and learned information 
about COIs. It contains histories of searches conducted by members of COIs. 
Provisioned data includes pre-populated information including catalogues of known 
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professions, hobbies, demographic groups, etc. It contains references to lexical 
dictionaries, sources preferred, user profiles, display techniques, etc. It also 
contains references to Source Characterization Databases. 

Detailed Description Text (58) : 

To effectively provide service to new users, and those identified for the first 
time with a COI, a pre-populated global description can be used. Library lists and 
Indexes of Job Classifications may be used to begin the database. This will assist 
in filling in the spaces of COI's. Looking at the sources available online and the 
descriptions of the target audiences will also provide a piece of the COI database. 

Detailed Description Text (59) : 

Once users of the system collect around each COI, the basic database 
representations may be modified to reflect the measurable requirements of the 
actual members of the group. The database will contain job categories, population 
categories, demographic information, population segmentations, memberships of known 
organizations and groups, hobbies and interests categories, etc. 

Detailed Description Text (64) : 
Personality Profile Catalogue 

Detailed Description Text (65) : 

With reference back to FIG. 1, Personality Profile Catalogue 50, besides containing 
profiles unique to individuals, contains personality profiles for celebrities and 
generic profiles for miscellaneous use or, perhaps, for use as templates for 
defining initial individual profiles. It should be noted that Personality Profile 
Catalogue 50 and other databases are not limited to the above-mentioned exemplary 
personality profiles. 

Detailed Description Text (66) : 

Creating a COI profile in profile catalogue 50 for a user with or without 
registration by user would be done by: assessing technical papers, web site, email, 
subscriptions (including source characteristics) , mail lists to and from, items 
forwarded, organization charts, directory entries, query entries of the individual 
user 12 to determine special interests and clustering of features in the elements 
identified (a selection of these and other items can be assessed depending on 
access provided through group service arrangement, or individual subscription) ; 
borrowing a personality from a generic group similar to the individual then making 
changes as more information becomes available; assessing selected items in search 
results relating to an area of interest to determine characteristics and 
associating them with the user's profile; determining an age or social context of 
the individual; determining groups and individuals associated with the individual 
by reviewing email, technical papers, telephone and email address book, post, 
articles containing individual's name or member of group, etc.; determining an 
individual user's relationship to the group: casual, involved, etc.; recording 
multiple COI 1 s of the individual using the system; determining attributes relating 
to understanding and use of language including but not limited to: primary language 
used, other languages used, competence and performance in each; and, determining 
attributes of language for each COI including but not limited to: access to 
vocabulary of each specialty, estimated sophistication in subject or area of 
interest, facility with vocabulary in specialty (competence and performance) . 

Detailed Description Text (67): 

In addition, preferably there exists a database of catalogues of sources used in 
research, in a library, for market research, etc. The items in the catalogues, 
(books, journals, etc.) have known target audiences, and actual audiences. The 
audiences can change over time, and that is something we can observe by looking at 
the users selections online and updating the database. Over time there will be more 
fine granularity in the specialization of the sources. (Perhaps because online, 
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magazines and journals may not be tracked in their entirety, but may be counted by 
the individual articles, ads, etc.) There are temporal facts about elements, 
people, etc. that will make one item more desirable than another. For example, a 
research cardiologist might only be interested in items that are published within 
the past two years. When looking for a conference, he or she is only interested in 
conference listings that are recent, and that will occur at least one month in the 
future (for planning purposes). A student 8.sup.th grader may not be that sensitive 
to the calendar items. There is a different calendar that's important to a student. 
School year, holidays, time of day, etc. There may be some reason to select 
information or display information based upon the calendar people live by. 

Detailed Description Text (68) : 

As such, creating a proxy profile in profile catalogue 50 for a COI would be done 
by: identifying sources that fit the COI (an additional tier of sources should 
exist to provide flexibility) ; and, identifying a calendar for relevant items that 
best fits the COI (a secondary calendar should exist) . 

Detailed Description Text (69) : 

FIG. 10 provides a flow chart of a Profile Provisioning for Search Function 280. A 
test 282 is first performed to determine if the user is known. In case test 282 
returns a yes answer, yes path 284 is invoked where test 286 determines how the 
user is known. Possible answers to test 286 are, for example, a company database, a 
social group/organization database, public database/directory, school government 
database, found online (email, web site, document, papers, etc.), other private, 
other public, and has profile. Once it is determined how the user is known, 
processing continues at node 288 to populate profile catalogue 50. 

Detailed Description Text (70) : 

Returning to test 282, if a no answer is returned, no path 290 is invoked where 
function 292 offers the user the following profile options: provide profile 
information, select a generic profile, select a celebrity profile, or bypass the 
profile option. If any choice other than the bypass option is returned by the user 
at 294, processing continues at node 288 to populate profile catalogue 50. Bypass 
path 2 96 is invoked if the user chooses to bypass the profile option, where 
collection function 2 98 gathers keyboard/user interface entries and items selected 
from a results listing. Compare function 300 compares the items gathered by the 
previous collection function with known COIs. Select function 302 then selects the 
closest COI for addition to profile catalogue 50. 

Detailed Description Text (71) : 

Once it is determined that Personality Profile Catalogue 50 needs to be populated 
or updated from the above-described procedures, the task is reduced to determining 
what to put into Personality Profile Catalogue 50. FIG. 11 shows a flow chart for a 
portion of process 310 to make this determination, and this determination is made 
depending on the source of information. Process 312 handles the case where the 
source is a social group/organization database. 

Detailed Description Text (72) : 

Catalogue function 314 catalogues the social group, relationships and organization 
into profile database 50. Identify function 316 identifies the COI of the group 
including subcategories for populating profile database 50. Scan function 318 scans 
documents, web sites, etc. for words and terms to populate profile database 50. 
Scan function 32 0 scans email for subjects, senders, etc. to populate profile 
database 50. Scan function 322 scans and processes text files to create a group 
profile. It creates a lexicon of terms used by the group or correlates the lexicon 
from known sources such as organization/subject dictionaries. Create function 324 
creates a geographic representation of the organization/group, locations of people, 
etc. to populate profile database 50. 

Detailed Description Text (73) : 
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FIG. 12 shows a flow chart for process 330 which handles the case where the source 
is found online based on the user's ID or name. Decision function 332 determines 
whether the source is a personal web site, directory, document, etc. If the source 
is a personal web site, scan function 334 scans the web site for clusters of terms 
used. Correlation function 336 correlates the clusters with one or more COIs. 
Matching function 338 chooses the best fit COIs, and populate function 340 
populates profile catalogue 50 with a template of that COI or COIs. If decision 
function 332 determines that the source is an online directory, scan function 342 
scans the directory for features, geography, organization charts, etc. Select 
function 344 selects a path to a generic profile with identifying data and 
geography or communication, if known. If decision function 332 determines that the 
source is an online document, scan function 346 scans the document and determines 
if an author is known. If an author is known, path 348 is traversed, and weight 
function 350 weighs the document high because the author is known. Check function 
352 attempts to determine the publisher source, and matching function 354 
determines a best fit COI. If the author is unknown, however, content match 
function 356 is invoked, and check function 358 searches for word clusters of 
interest . 

Detailed Description Text (76) : 

FIG. 15 shows a flow chart for the final process 380, of determination process 310, 
which handles the case where the source is a company database. Function 382 
references organization charts, relationships and departments. Function 384 
determines a job title, and if a COI matches the title directly, populates profile 
catalogue 50 with the COI. Function 386 references the job description, and 
performs cross-referencing of intelligent databases, and catalogues information if 
necessary for populating profile catalogue 50. Function 388 references any 
referenced web sites and scans those web sites for clustering categories and 
clusters of text contents for determining COIs. Function 390 references emails and 
handles these with appropriate scanning functions, scanning for content, senders 
information and subjects. Function 392 references documents databases and scans for 
features and clusters within those databases. Function 394 references 
communications links such as, for example, email, phone, fax, to determine features 
and clusters for correlating to COIs. And finally, function 396 references 
geographic locations for home, office, etc. for populating profile catalogue 50. 

Detailed Description Text (78): 

Source Characterization Database 52 contains catalogues of sources, target 
audiences, demographics, publishers, useful features, cross references, COI 
references, provisioned and learned, market research, dates, frequencies of updates 
and types of source (electronic, broadcast, video, personal web site, etc.). 

Detailed Description Text (79) : 

Creating a proxy source catalogue in characteristics catalogue 52 would be done by: 
identifying known subject matter sources including but not limited to journals, 
newspapers, magazines, web sites, brick and mortar sources, paper published 
sources, etc.; and, populating the catalogue 52 with descriptions, links, profile 
of language, type of data available in each source, availability online. 

Detailed Description Text (80) : 

Thus, resources on the network of networks can be catalogued by labeling each known 
resource (journals, periodicals, publications, organizational publications, 
professional publications, newspapers, personal sites, business sites, catalogues, 
library resources, market research, etc.), identifying COI 1 s interested in each 
known resource, correlating vocabulary used in known resources with COI, and 
periodically checking the vocabulary and update it with new buzz words, jargon, 
etc . 

Detailed Description Text (93) : 

The personality profile catalogue or database 50, as noted above, is a database 
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that can identify characteristics of the user that will assist in formulating a 
query, determining which items in a list the user may be interested in, selecting a 
display technique, determining which sources to prioritize, etc. Problems faced in 
using the strategy of COI for search include being able to identify the COI of the 
user without collecting specific information from the user. The system can store a 
series of default profiles, then observe user behavior, query user preferences, and 
modify the profile for future searches. 

Detailed Description Text (101) : 

Writing style in the sources selected by user 12 or by an expert will be used to 
identify characteristics of the COI. These characteristics will be catalogued by 
Smart Search system 10. Writing style of the user 12 will be characterized and 
stored in a personality profile 50 database. 

Detailed Description Text (103) : 

Search results sources will be catalogued to identify characteristics of the 
sources including but not limited to: dates and frequency, COI, COI's, general 
level, degree and level of jargon, categories of resources (calendars of events, 
classified, technology reviews, etc.) and will be used to inform user about 
sources. The source characteristics catalogue 52 will record whether the source is 
a primary source, secondary source, web site, commercial publication, technical, 
business, fiction, or other type of source. Experts will annotate the source. 
Source participants can add information to the record about the source. 

Detailed Description Text (104) : 

One method of presenting results is to list categories in a series of nested menu 
windows based upon the data returned and how it clusters in categories, and 
metacategories. Category names can be derived from the data and then recorded for 
future use by the Smart Search system 10 application. Categories can also be 
derived from the data with the supervision of an expert resource person, or subject 
matter expert. Category differentiation is based upon fine-grained understanding 
and differentiation among terms, phrases, reserved words, jargon, technical terms, 
sources selected, and resulting elements. 

Detailed Description Text (105) : 

The complete results presentation will contain representations of each distinct 
concept represented in the results and not constrained by COI, and a reference to 
each data point in each concept representation. Since many elements in a search 
result list belong to more than one category or abstract concept, the relationship 
among categories will be presented. 

Detailed Description Text (107) : 

When sources are identified by the Smart Search system 10 application, annotation 
from the cataloguing process will be made available. The annotation will inform the 
user 12 about the source and about who created it and who (COI) uses the source. 
This information may be provided by the source owner, an expert, or automatically 
determined. For certain applications Smart Search system 10 will also link to the 
main information center for the source identified. 

Detailed Description Text (115) : 

Another means for altering the direction of the search application is to add 
clarification by looking at the data represented in the results. By selecting other 
features of the data to display relationships not clearly illustrated in the first 
display of the data, a search result may reveal new information to the user. The 
results data is representing the view from a particular point of view or COI . The 
user is not limited to this presentation of the data. Other features identified in 
the search are available and may be selected from a window, or by another means of 
manipulating the data of the results. A helpful device would be a window that lists 
all the recognized significant data relationships in the search results. For 
example, if 20% of the search results were from technical journals, and 15% of the 
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search results were from items dating within the past ten years, those facts would 
be listed in the window for the user to see. The data itself reveals this 
information, using the database stored in the search application server to verify 
sources, publication dates, subjects and relevance, etc. Attributes about the users 
of the source are also relevant to the COI. If a description of the users of a 
source is made available, then the COI can be matched more directly with the 
targeted market of the sources. Even if the results are displayed by categories 
assumed to be important to the COI, there may be a feature in the data that is of 
interest to the user and is not graphically represented. By using a method such as 
the window mentioned above, a user may identify a feature that is of interest, and 
request that the display be modified to illustrate that feature. 

Detailed Description Text (116) : 

Example: Theater Directions Magazine Monthly 20,000 Circulation US Published in 
English Specialized periodical for Theater professionals Ages of readers 
Demographics of readers Additional information about readers 35 Articles monthly 
Information about Authors 200 Advertisers Information about advertisers Personals 
Classifieds Etc. 

Detailed Description Text (118) : 

Search Application: Prefers Theater Directions Magazine as a source for search 
conducted by a member of the Thespian COI . A search may result in a listing of 
results spanning 3 major unrelated COI 1 s . For example, a search on Home Networking 
may result in thousands of items in a list spanning many pages of titles. When 
looking at the results, they may fall into the categories of: home communications 
network architectures including such subjects as computer LAN's, etc., home health 
aides visiting the ill or elderly, and home school programs using a network of. 
teachers and other resources. By using the strategy associated with Communities of 
Interest, the system would identify which major subject area was related to a 
particular COI through: knowledge of the sources of materials, subject handled by 
the source, types of authors, types of content, disambiguating words used for the 
search and filtering for features related to the COI, etc. Without specific 
knowledge of the COI, the total of the search results would be segregated based 
upon strata and subject as defined in the global COI database. The results would 
then be displayed showing the features that differentiate them, in this case being 
home electronics/communications, social work, education, and other. 

Detailed Description Text (121) : 

Humans are particularly good at identifying patterns in data when the data is 
presented in an appropriate graphical form. The key is to identify reasonable 
graphical representations for abstract data elements. It is another object of the 
present invention to identify subject matter and object classifications that will 
be modeled graphically so search results may be presented to the user 12. An 
example for representing solutions graphically would involve using a metaphor for 
placing pushpins representing search returned objects at a high level and with 
relationships described graphically (between the returned objects) . Color can be 
used to indicate proximity characteristics. Sound can be used to illustrate 
relationships among items , and to represent navigational distances between ob j ects . 
Sounds can enhance an application so audio-based interfaces may be used in addition 
to graphical. For example, a search might be conducted using a cell phone, and the 
user 12 will listen to results receiving audio cues to help formulate relationships 
among returned items. For example, suppose a consumer is searching for a particular 
car. The consumer knows some attributes relating to the car, such as the 
manufacturer and the year. The Smart Search will enable the consumer user 12 to use 
a configuration management approach to filling in a form through which the search 
will be formulated. The assumption here is that the consumer's shopping interests 
can vary along a continuum from a very directed search through to a rather 
unconstrained browsing type of search. An important issue relating to these 
dimensions of searching is the ability for a search engine to have varying degrees 
of information presented during the query and still return useful, targeted, and 
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understandable results . 



CLAIMS : 

2. The method as set forth in claim 1 further comprising: representing interest 
categories and representing each differently for different interests; representing 
relationships among the categories ; and, identifying an individual and the interest 
categories to which he or she belongs . 

4. The method as set forth in claim 1 further comprising creating, updating and 
accessing personality profile records in a personality profile catalogue . 

10. The method as set forth in claim 1 further comprising iterating using 
configurable query techniques, using a lexical dictionary for disambiguation of 
query text. 

12. The system as set forth in claim 11 further comprising at least one database 
for storing information representing interest categories, representing 
relationships among the categories and identifying an individual and the interest 
categories to which he or she belongs. 

14. The system as set forth in claim 11 further comprising a personality profile 
catalogue . 

17. The system as set forth in claim 11 further comprising a Community of Interest 
Database for storing the interest categories . 

20. The system as set forth in claim 11 wherein the interface facilitates 
communication of configurable query techniques, using a lexical dictionary for 
disambiguation of query text. 
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The disclosed invention is a system and method (collectively the "system") for the 
automated management of content on a network interface. The network interface can 
be a web site on the World Wide Web, an Internet location, an intranet location, an 
extranet location, or some other form of network interface (collectively "web 
site") . The system can automatically create applications and links to those 
applications without human intervention. Examples of. automated applications include 
newsroom applications, calendar of events, employment opportunities, project 
portfolio, biographies, frequently asked questions, document library, category 
management, product catalogs, e-mail broadcasts, surveys, and newsletters. Fully 
normalized hierarchies of business rules and user profiles can be supported by the 
system to facilitate automation and configurability. Multiple content providers can 
manage a single web site in a simultaneous or substantially simultaneous manner. In 
ASP embodiments, multiple organizations can use the system to manage multiple web 
sites in a substantially simultaneous manner. 
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ABSTRACT : 



A method for creating a rights data dictionary for definition of terms of a rights 
expression language is disclosed. The data dictionary may be used in computer 
operations on rights expression metadata, such as for digital rights management 
applications. The data dictionary may also be used for translating between 
expressions in different rights expression languages, thereby enabling 
interoperability of rights expressions. The rights data dictionary is organized 
according to a hierarchical schema. Terms are added to the data dictionary 
according to specified rules that relate to a basic term set, and in particular, 
are begotten from act type terms. The schema includes a "context" term of the basic 
term set that is assigned a meaning signifying a circumstance in which at least the 
basic action occurs. Other terms may be developed from the context term and defined 
by their relationship to it. 
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